Meetings in Physics at University of Sofia, vol. 3 
ed. A. Proykova, Heron Press, Sofia, 2001 



Optimization of molecular clusters configurations using a Genetic Algorithm 



Stoyan Pisov and Ana Proykova 

Department of Atomic Physics, Faculty of Physics, University of Sofia, Sofia-1126, 5 
J. Bourchier Blvd. 



Abstract. We present a genetic algorithm developed (GA) to optimize molecular 
AFq cluster configurations with respect to their energy. The method is based on 
the Darvin's evolutionary theory: structures with lowest energies survive in a system 
of fixed number of clusters. Two existing structures from a given population are 
combined in a special way to produce a new structure (child) which is kept if its energy 
is lower than the highest energy in the ensemble. To keep the population constant 
we reject the structure with the highest energy. This algorithm gives a better result 
than the optimization techniques used previously. Using the GA we have found a new 
structure corresponding to the (seemingly) global minimum. The most important 
result is that the new structure is detected only if the molecular cluster contains 
more than a critical number of molecules. 



1. Introduction 

Optimized structures give a detailed information 
about symmetry, phase transitions and we also can 
use them to calculate density of states of the studied 
substance. Every optimization algorithm tries to find 
the configuration with the lowest energy. Usually it 
minimizes the potential energy performing consecu- 
tive steps from one to another configuration by in- 
specting the local minima on the potential energy 
surface (PES). Molecular clusters made of octahe- 
dral molecules (AF 6 ,A = S, Se,Te,U) have rugged 
potential energy surface jjj . Attempts to use simulat- 
ing annealing [|2| to find the global energy minimum 
in the systems often fail due to high-energy barriers 
(Fig. 7 in ||), which trap the simulated system in 
one of the innumerable metastable configurations. 
A cartoon can be seen in Fig. ^. On such a sur- 
face, techniques like simulated annealing, quenching 
or the conjugate gradient method find the local 
"global" (glocal) minimum , which might lie higher 
than the true global minimum if it is localed in an- 



other basin ||. Hence we need an algorithm, which 
would permit "jumps" from one basin to another 
and sample properly the phase space. Various tech- 
niques of global optimization have been proposed: 
basin hopping j^], genetic algorithm [^|, adiabatic 
switching Q. 

In the present work we further elaborate GA origi- 
nally developed of Deaven and Ho for atomic systems 
in order to make it applicable to molecular clusters. 
We implement the new algorithm for configuration 
optimization of AFq clusters, simulated with molec- 
ular dynamics jlj. A new structure has been found 
that has never been seen in our previous investiga- 
tions [|, |. 

2. A genetic algorithm for molecular 
clusters 

In this section we describe our genetic algorithm in 
detail. Each molecule is defined with a pair of co- 
ordinates {x, q} = X, x = {x,y, z} is the Carte- 
sian coordinate of molecular center of mass, q = 
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Figure 1. Schematic representation of Potential En- 
ergy Surface (PES). 

{qo, qx, (72, 93} is molecular orientation in quaternion 
representation. We denote a cluster configuration 
with N molecules with: 

® = {X 1 ,X 2 ,...,X N } 

The genetic algorithm uses a population of n struc- 
tures {0}, n is kept constant during the optimization 
run. We define a mapping operator P : P(0, 0') — > 
0", which performs the following action upon two 
parent geometries 25 and 25' to produce a child 0". 
First we select parents from the population using 
the distribution Eqn.(^). Second, we choose planes 
that account for the parent clusters packing symme- 
try and pass through the center of mass of the par- 
ents. Then we cut the clusters in the chosen planes. 
We would like to underline that the choice of cutting 
planes is crucial for the proper work of the algorithm. 
In other words, it is very important to find out the 
packing symmetry of all clusters in the populations. 

In the case of solid molecular clusters at a low tem- 
perature, the centers of the molecules hardly move 
but their orientations do. Hence, in the present ver- 
sion of Genetic Algorithm, we match centers of the 
parent clusters before searching for a suitable plane. 
After cutting the parents, we assemble the child 0" 
from the molecules of which lie above the plane 



and the molecules of 25' which lie below the plane. 
If the child generated in this manner does not con- 
tain the correct number of molecules, we translate 
the plane until the child 25" contains the correct 
number of molecules. Relaxation to the nearest lo- 
cal minimum is performed with a conjugate gradient 
minimization j^, R). 

We preferentially select parents with a lower en- 
ergy from {25}. The probability p(25) of an individual 
candidate 25 to be selecting for mating is given by 
the Boltzmann distribution. 

p(25) oc exp[-E{e)/K h T m ] (1) 

where E(<8) is the energy of the candidate 25, Kb 
is the Boltzmann constant and T m is the mating 
"temperature", chosen to be roughly equal to the 
range of energies in {25}. For a better performance 
we can apply mutations to some members (p) of 
the population. The mutation operator is defined as 
M : Af(25) — > 25' which performs two random ac- 
tions with the same probability. First, M moves the 
coordinates of mass centers in a random direction 
with a random step. Second, M rotates the chosen 
molecule at a random angle. Such mutation can be 
applied to some molecules in a cluster or to all of 
them. 

We create subsequent generations as follows. Par- 
ents are continuously chosen from {0} with a prob- 
ability given by Eqn.(Q) and mated using the mat- 
ing procedure described above. The fraction /i of the 
children generated in this way are mutated; fi = 
means no mutation occurs. The (possibly mutated) 
child is relaxed to the nearest local minimum and re- 
placed with a configuration with a higher energy in 
population {0} if its energy is lower than the higher 
energy. This algorithm requires a great number of 
members (n) in the population in order to prevent a 
rapid convergence to a set of identical candidates. 

3. Results for TeF§ clusters 

To illustrate the method, we used configurations for 
TeFe clusters obtained in MD simulations described 
elsewhere Q|. 



Short title 



3 



B -21.72 
a. 

-21.74 



■21.78 ■ 
-21.80 




Figure 2. A population of n = 100 clusters each con- 
taining 89 TeFg molecules finds the configuration 
with the lowest energy about 160 iteration steps. 



Fig. g shows the minimum energy found in an op- 
timization of n=100 configurations (starting popula- 
tion) of 89-molecule TcFq clusters. During the first 
100 iterations, the algorithm effectively and rapidly 
creates better children. Then the process is slowed 
after the 100 th step. An important comment is that 
we start with a population of structures already op- 
timized with conjugate gradient or simulated anneal- 
ing methods. We underline that both techniques give 
the same optimized structures. 

In the case of 89 and 137 TeF$ clusters we have 
found a new local minimum on the PES which cor- 
responds to quite a different structure with respect 
to the orientational order. Fig. || represents these 
orientational structures produced with two different 
minimization algorithms, e.g. the conjugate gradient 
and genetic algorithm. 

We have found that in the case of TeF$ clusters 
with 27 and 59 molecules such an orientation distri- 
bution can not be found as it is seen in Fig. [|. This 
is a pronounced size effect. For the case of 89 TeF§ 
clusters we perform a sequential molecular dynamic 
run to check if this new structure is stable. Start- 
ing from a low temperature « 0.5 K, we increased 
the temperature up to 30 K and the structure is still 
stable. 



Figure 3. Distribution of mutual orientations of ev- 
ery pair of molecules in the clusters. The genetic 
algorithm finds a unique structure (the peak at 
cos(9) w 0.66 is missing) if the number of the 
molecules in a cluster is greater than 80. 





Figure 4. The distributions of the mutual orienta- 
tions of molecules show that smaller clusters (N < 
59) cannot pack in the new structure. 



4. Conclusion 

Although the topography of the studied PES is 
very complicated, the newly developed algorithm has 
shown a great ability to find the "glocal" minimum. 
Mutations often boost this ability but in some cases 
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it becomes worst. One can optimize the mutation pa- 
rameters (percentage of mutated-cluster, mutation 
operator M, etc.) to obtain better results ]l0| . Fi- 
nally, we have found that the genetic algorithm can 
used to " clean" defects in structures optimized with 
techniques. For instance, the free molecular clusters 
have many surface molecules oriented improperly in 
comparison to the others even in the lowest energy 
configuration found with any other method. Such a 
procedure (cleaning) is very important if the density 
of states is needed. The general shortcoming of the 
method is its slowness, which makes its application 
limited. 
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